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REMARKS 

This is an amendment after final action and request for reconsideration 
filed under 37 C.F.R. 1.116. The changes in the claims reduce the issues in any 
sut>sequent appeal procedure. The broad claim 1 has been canceled and the 
claims that depended on claim 1 have been amended, so that they now depend 
on the more limited amended daim 3. 

I. Obviousness Rejection based on Huang and Coorman 

Claims 1 to 2, 4 to 8 to 9 and 11 to 12 were rejected under 35 U.S.C. 
103 (a) over Huang, et al (referred to below as "Huang") and Coorman, et al. 
(referred to below as "Coorman"). 

Cancellation of claim 1 and the change in the dependency of the 
dependent claims so that they all depend on amended independent claim 3 has 
obviated this rejection under 35 U.S.C. 103 (a). 

II. Obviousness Rejection based on Huang, Coorman and Carter 

Claim 3 was rejected as obvious under 35 U.S.C. 103 (a) over Huang, et 
al. (refen^ to below as "Huang") and Coomnan, et al. (referred to below as 
"Coornian"). and further in view of Carter, et al. (refen«d to below as "Carter")- 
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Huang and Coorman and their relationship to the claimed inventive 
method have been discussed in the previously filed amendment 

Full credit should be given to the applicants' definition of ''multimedia data" 
on page 5 of the applicants' specification. Applicants are their own 
lexicographers (M.P.E.P. 2173.05 (a)). The term, ''multimedia data", should not 
be interpreted more broadly than the definition on page 5 of the specification, 
which is "audio-visual information" . The term "multimedia data" in the claims 
should be accorded this meaning. 

According to claim 3 the multimedia data stream includes phonetic 
translation hints that determine how respective words following the phonetic 
translation hints are converted or translated into speech by a speech synthesizer. 
The novel feature of the presently claimed method includes controlling the 
method to specify a portion of the textual description during which the phonetic 
transcription hint of a given word or text segment need not be repeated. The 
phonetic translation hints for the respective words associated with them are not 
repeated in this portion of the textual description when the words repeat 
according to the method of claim 3. 

This feature of the method of claim 3 provides a savings in processing 
yffork because the data stream is shorter, especially if the word or words provided 
with the phonetic translation hints are repeated a considerable number of times. 
This provides the flexibility that is needed so that foreign pronunciations of 
foreign words embedded in a multimedia data stream in a native language that is 
already recognized by the synthesizer can be properly pronounced, as they 
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would by a foreigner speaking that language (see page 15 of applicants' 
specification). 

Huang and Coorman do not disclose or suggest this distinguishing feature 
of the method of claim 3, a fact that is recognized in the fj n ^ l Office Action 
because Carter is needed to allegedly suggest this feature. However applicants 
respectfully submit that Carter does not suggest all the modifications of the 
subject matter of Huang and Coorman, which are necessary to arrive at the 
inventive method as claimed In claim 3. 

Carter discloses a method and apparatus and computer program for 

reducing load on a text-to-speech converter in a message system capable of 

text-to-speech conversions of E-mail documents (title, abstract). Column 4, linesi 

to 23, does disclose a converter apparatus with a cache memory that stores 

certain text segments (words) of a received E-mail message in a cache memory 

along with the converted speech signals for those text segments. Then when 

playback of other E-mail messages is requested jf the other E-mail messciges 

contained a text segment (word) that is stored in the cache memory along with 

the corresponding speech signals, the text-to-speech conversion process for that 

text segment (word) is by-passed and instead the stored speech signal is 

retrieved. For example, see figure 3 and the description associated with it, 

especially column 3, line 67 to column 4, line 10, which states: 

''Upon a request by a user to convert the text segments of a new e-mail 
message to speech signals for playback via a telephone handset 34, a 
controller 42 determines whether any of the text segments of the new 
e-mail message are identical to previously converted text segments for 
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which speech signals are already stored in the cache 40. If so, the stored 
speech signals of those text segments are played back to the user from 
the cache, thus avoiding the need for the text-to-speech converter to 
convert those test segments of the new e-mail message to speech." 

There are advantages to associating a respective phonetic translation hint 
with a particular word or text segment during a portion of the data stream Instead 
of storing the speech signals associated with the word, although then it is 
necessary to repeat the speech synthesis or conversion process each time the 
word or text segment is repeated. However in the method of claim 3 each time 
the word is repeated it Is not immediately preceded by the associated phorietic 
translation hint. This significantly shortens the data stream in the case of words 
which are repeated a large number of times and provides the flexibility to cope 
with special cases where automatic transcription is not applicable (see 
applicants' speciftcation on page 3, lines 13 to 16). 

Carter cleariy does not suggest associating a particular phonetic 
translation hint with a given word or text segment over a certain portion of the 
text or data stream. Instead actual speech signals are associated with the given 
word. These actual speech signals are stored in memory so that they are 
retrieved and played if the word is repeated, as shown by the quotation from 
column 4 of this reference. 

However the method of Carter is inefFicient arKi consumes much memory 
or requires longer time intervals, because of the complexity of digital or analog 
speech signals necessary for faithful sound reproductbn, since large amounts of 
data must be retrieved and/or stored. 

8 



PA(X 9f33 ' RCVD AT 1 1»S 4:24: 13 PM [Eastern Stands 



' 11/03/2065 17:24 TEL 1 631 549 0404 STRIKER @ STRIKER US PTO 



©010/033 



Thus Carter only suggests storing speech signals or associating a 
respective speech s^nai with a particular word so that repetition of the 
conversion process Is unnecessary vvhen the word is repeated. Carter does not 
disclose or suggest associating a respective phonetic translation hint with a 
particular word, so that the translation hint does not need to be repeated, as 
claimed in claim 3. These are entirely different methods and the method of Carter 
would not suggest the method of claim 3. Carter does not suggest all the 
modifications of the subject matter of Huang or Coorman necessary to arrive at 
the method as claimed in claim 3. 

Also, the method of applicants' claim 3 requires that the previous phonetic 
transcription hint is valid for a defined portion or all of the textual description in 
the data stream (claim 3). No storage and no retrieval of speech signals or any 
other data in and from a memory are necessarily required in the claimed method 
of applicants' claim 3 In contrast to the method of claim 5 of Carter, which is 
described in part in column 4. For example, the data stream could include 
control characters (see page 11 and 13 of applicants' specification) that define 
the portion of the data stream over which repetition of the hint is not necessary 
so that It would not be necessary to store the hint itself. 

in addition, because storage and retrieval of speech signals in a memory 
is required in the method of Carter, the savings due to the reduction in word-to- 
speech signal conversion work when text segments are repeated is counteracted 
by the time required for storage and retrieval of the speech signals. The 
embodiments using the cache memory (for speed) are particulariy limited, as 
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explained in column 4. lines 13 to 23. of Carter, because the cache can only 
function w ith repeated text segments that have 40 or fewer characters. There are 
no such limits due to storage and retrieval work that are necessarily present in 
applicants' method of claim 3. The phonetic translation hint could in prindpte be 
larger than 40 characters and apply to a large text segment, for example, an 
entire sentence or phrase in a foreign language. 

It Is well established by many U. S. Court decisions that to reject a 
claimed invention under 35 U.S.C. 103 there must be some hint or suggestion In 
the prior art of the modifications of the disclosure in a prior art reference or 
references used to reject the claimed invention, which are necessary to arrive at 
the claimed invention. For example, the Court of Appeals for the Federal Circuit 
has said: 

"Rather, to establish obviousness based on a combination of eiemerrts 
disclosed in the prior art, there must be some motivation, suggestion or 
teaching of the desirability of making the specific combination that was 
made by the applicant... Even when obviousness is based on as single 
reference there must be a showing of a suggestion of motivation to modify 
the teachings of that reference.." In re Kotxab, 55 U.S.P.Q. 2"^ 1313 
(Fed. Cir, 2000). See also M.P.EP. 2141 

The term "phonetic translation hint" cannot be reasonably interpreted as 
including ''speech signals^ Speech signals are the resulting signals output from a 
speech synthesizer conversion or translation process. However "phonetic 
translation hints" are data, which is input to a speech synthesizer or converter to 
produce the resulting speech signals. An output signal from a special processor 
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cannot be held to be the same as the input signal. 

Thus Carter does not reasonably suggest storing and retrieving phonetic 
franslation hints for a particular word in a portion of a data stream so that 
repetition of the translation hint Is not required when the word is repeated. Carter 
would only suggest storing the resulting speech signals of a word so that the 
conversion process does not need to be repeated or associating the converted 
speech signals with a particular word so that the conversion Is not necessary 
when the word Is repeated. The one method is not obvious from the other. 

For the foregoing reasons and because of the changes in claim 3, 
withdrawal of the rejection of claim 3 as obvious under 35 U.S.C. 103 (a) over 
Huang, et al, and Coomian, et al, and further in view of Carter, et al» is 
respectfully requested. 

Similarly for the same reasons amended dependent claims 2, 4 to 6, 8, 9. 
1 1 and 12 should not be rejected under 35 U.S.C. 103 (a) over Huang, et al, and 
Coorman, et al, and further in view of Carter, et al. 

III. Obviousness Rejection based on Huang, Coorman and Sharman 

Claims 7 and 10 were rejected as obvious under 35 U.S.C. 103 (a) over 
Huang, et al. (refen^ed to below as ''Huang'') and Coormann, et al, (refen-ed to 
below as "Coormann"). and further in view of Shamian, et al (referred to below as 
"Sharman"), 
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The features of claims 7 and 10 are currently not being relied on to 
establish patentability of the claimed method, but the dependent claims have 
been amended so that they now depend on claim 3. Instead these features are 
features of preferred embodiments of the amended method claim 3. 

For the foregoing reasons withdrawal of the rejection of claims 7 and 10 
as obvious under 35 U.S.C. 103 (a) over Huang, et al, and Coorman, et al, and 
further in view of Sharman, et al, is respectfully requested. 

Should the Examiner require or consider it advisable that the specification, 
daims and/or drawing be further amended or corrected in formal respects to put 
this case in condition for final allowance, then it is requested that such 
amendments or corredions be carried out by Examiner's Amendment and the 
case passed to issue. Alternatively, should the Examiner feel that a personal 
discussion might be helpful in advancing the case to allowance, he or she is 
invited to telephone the undersigned at 1-631-549 4700. 

In view of the foregoing, favorable allowance is respectfully solicited. 



Respectfully submitted. 




Attomey for the Applicants 
Reg. No. 27,233 
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